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Cross-Reference to Related Applications 

5 

The present application is related to U.S. Patent Application No. _/ 

(Attorney Docket No. CISCP219) by Wu et al., and titled Methods and Apparatus for 

Transform Coefficient Filtering and U.S. Patent Application No. _/ (Attorney 

Docket No. CISCP234) by Lee et al, and titled Methods and Apparatus for Selecting a 
10 Cut-off Index, both filed on the same day as the present application. Each of the above 
patent applications is incorporated herein by reference for all purposes. 

Bacl^round of the Invention 

15 The present invention relates to rescaling data. More specifically, the present 

invention relates to updating reduction ratios used for rescaling data sequences. Still 
more specifically, the present invention provides techniques for determining reduction 
ratios for modifying transform coefficients associated with an input data sequence (e.g. 
an audio segment or a video sequence) to provide modified transform coefficients 

20 associated with a modified output data sequence. 

Video data is one particularly relevant form of data that can benefit from 
improved techniques for rescaling. Generally, compressing data or fiirther 
compressing compressed data is referred to herein as rescaling data. Video rescaling 

25 schemes allow digitized video fl-ames to be represented digitally in an efficient manner. 
Rescaling digital video makes it practical to transmit the compressed signal by digital 
channels at a fraction of the bandwidth required to transmit the original signal without 
compression. International standards have been created on video compression schemes. 
The standards include MPEG-1, MPEG-2, MPEG-4, H.261, H.262, H.263, H.263+, 

30 etc. The standardized compression schemes mostly rely on several key algorithm 
schemes: motion compensated transform coding (for example, DCT transforms or 
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wavelet/sub-band transforms), quantization of the transform coefficients, and variable 
length coding (VLC). 

The motion compensated encoding removes the temporally redundant 
5 information inherent in video sequences. The transform coding enables orthogonal 
spatial frequency representation of spatial domain video signals. Quantization of the 
transformed coefficients reduces the number of levels required to represent a given 
digitized video sample and reduces bit usage in the compression output stream. The 
other factor contributing to rescaling is variable length coding (VLC) that represents 
10 frequently used symbols using code words. In general, the number of bits used to 
represent a given image determines the quality of the decoded picture. The more bits 
used to represent a given image, the better the image quality. The system that is used 
to compress digitized video sequence using the above described schemes is called an 
encoder or encoding system. 

15 

More specifically, motion compensation performs differential encoding of 
frames. Certain frames, such as I-fi:ames in MPEG-2, continue to store the entire 
image, and are independent of other frames. Intracoded frames, such as B-frames or P- 
frames in MPEG-2, store motion vectors associated with the movement of particular 

20 objects. The pixel-wise difference between objects is called the error term, which can 
be stored in P-frames and B-frames. In MPEG-2, P-frames reference a single frame 
while B-frames reference two different frames. Although this allows fairly high 
reduction ratios, motion compensation is limited when significant changes occur 
between frames. This precludes high reduction ratios. Furthermore, motion 

25 compensation can be computationally expensive. 

Each frame can be converted to luminance and chrominance components. As 
will be appreciated by one of skill in the art, the human eye is more sensitive to the 
luminance than to the chrominance of an image. In MPEG-2, luminance and 
30 chrominance frames are divided into 8x8 pixel blocks. The 8x8 pixel blocks are 
transformed using a discrete cosine transform (DCT) and scaimed to create a DCT 
coefficient vector. Quantization involves dividing the DCT coefficients by a scaling 
factor. The divided coefficients can be rounded to the nearest integer. After 
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quantization, some of the quantized elements become zero. The many levels 
represented by the transform coefficients are reduced to a smaller number of levels 
after quantization. With fewer levels represented, more sequences of numbers are 
similar. For example, the sequence 4.9 4.1 2.2 1.9 after division by two and rounding 
becomes 2 2 11. As will be described below, a sequence with more similar numbers 
can more easily be encoded using VLC encoding. However, quantization is an 
irreversible process and hence introduces significant loss of information associated 
with the original frame or image. 

VLC encoding takes the most common long sequences of numbers of bits and 
replaces them with a shorter sequence of numbers or bits. Again, VLC encoding is 
limited by common sequences of numbers or bits. Data containing fewer common 
sequences take more bits to encode. 

Currently available compression techniques for rescaling data (e.g. video or 
audio) are limited in their ability to effectively compress data sequences for 
transmission across networks or storage on computer readable media. The available 
techniques also have significant hmitations with respect to loss, computational 
expense, and delay. Various techniques for reducing the bit rate of compressed data 
sequences including audio and video streams are being developed. Some of the more 
promising approaches are described in U.S. Patent No. 6,181,711 titled System And 
Method For Transporting A Compressed Video And Data Bitstream Over A 
Communication Channel. Other approaches are described in U.S. Patent Application 
No. 09/608,128 Methods And Apparatus For Bandwidth Scalable Transmission Of 
Compressed Video Data Through Resolution Conversion and U.S. Patent Application 
No. 09/766,020 titled Methods For Efficient Bandwidth and U.S. Patent Application 
No. 08/985,377 titled System And Method For Spatial Temporal-Filtering For 
Improving Compressed Digital Video Scaling Of Compressed Video Data. Each of 
these references is assigned to the assignee of this invention and is incorporated herein 
by reference for all purposes. It is still desirable to provide additional techniques for 
rescaling data that improve upon the limitations of the prior art. 
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Summary of the Invention 



According to the present invention, methods and apparatus for updating 
5 reduction ratios are provided. An input bit sequence can be altered to provide modified 
output bit sequence. The input bit sequence may represent information in a portion of 
data such as a video sequence, a picture, or an audio stream. In one example, the bit 
sequence is an MPEG-2 video sequence. The input bit sequence is altered using a 
reduction ratio to provide a modified output bit sequence. The reduction ratio can be 
10 continually updated using rate control information to achieve a target reduction ratio 
for the input bit sequence. 

One aspect of the invention provides a method of altering blocks of transform 
coefficients associated with input bits to provide modified blocks of transform 

15 coefficients associated with output bits. A first block of transform coefficients 
associated with the input bits is identified. The first block of transform coefficients is 
altered by using a reduction ratio to generate a first block of modified transform 
coefficients. An updated reduction ratio is generated. A second block of transform 
coefficients associated v^th the input bits is identified. The second block of transform 

20 coefficients is altered to generate a second block of modified transform coefficients 
using the updated reduction ratio. 

The first block of transform coefficients may be identified by performing 
variable length decoding on the input bits, acquiring the transform coefficients from a 
25 file, performing a DCT operation on video data, or performing a DCT operation on 
audio data. 

Another aspect of the invention provides a method for altering transform 
coefficients associated with macroblocks in a frame having a frame size and a target 
30 reduction ratio. A number of input bits and a number of output bits associated with a 
set of processed macroblocks is identified. The processed macroblocks have altered 
transform coefficients. An updated reduction ratio using the number of input bits and 
the number of output bits associated with the set of processed macroblocks is 
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generated. Transform coefficients of a next macroblock using the updated reduction 
ratio are altered. 



Calculation of the updated reduction ratio can comprise using a spreading 
5 factor, a compensation factor, and a convergence factor. 

Yet another aspect of the invention provides an apparatus for altering transform 
coefficients associated with macroblocks in a frame having a frame size and a target 
reduction ratio. The apparatus comprises several components. A feedback stage is 

10 configured to identify a number of input bits and a number of output bits associated 
with a set of processed macroblocks, the processed macroblocks having altered 
transform coefficients. The feedback stage is further configured to generate an updated 
reduction ratio using rate control information. A filtering stage is coupled to the 
feedback stage and configured to alter transform coefficients of a next macroblock 

1 5 using the updated reduction ratio. 

Another aspect of the invention pertains to computer program products 
including a machine readable medium on which is stored program instructions, tables 
or lists, and/or data structures for implementing a method as described above. Any of 
20 the methods, tables, or data structures of this invention may be represented as program 
instructions that can be provided on such computer readable media. 

A further understanding of the nature and advantages of the present invention 
may be realized by reference to the remaining portions of the specification and the 
25 drawings. 
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Brief Description of the Drawings 



Figures 1 is diagrammatic representation of a system that can use the 
techniques of the present invention, according to specific embodiments. 
5 Figure 2 is a diagrammatic representation of another system that can use the 

tecliniques of the present invention, according to specific embodiments. 

Figure 3A and 3B are graphical representations of numbers, the DCT 
coefficients associated with the numbers, and the IDCT of the DCT coefficients, 
according to specific embodiments. 
10 Figure 4 is a diagrammatic representation of filters using zeros and ones, 

according to specific embodiments. 

Figure 5 is a diagrammatic representation of fihers using threshold values, 
according to specific embodiments. 

Figure 6 is a diagrammatic representation of the application of a filter using 
1 5 ones and zeros and a filter using threshold values, according to specific embodiments. 

Figure 7 is a process flow diagram showing techniques for applying a filter, 
according to specific embodiments, according to specific embodiments. 

Figure 8 is a diagrammatic representation of application of requantization, 
according to specific embodiments. 
20 Figure 9 is a process flow diagram showing techniques for applying 

requantization, according to specific embodiments. 

Figure 10 is a diagrammatic representation of filters and requantization levels 
that can be selected based on a reduction ratio, according to specific embodiments. 

Figure 11 is a process flow diagram showing techniques for selecting a filter 
25 and a requantization level using a reduction ratio. 

Figure 12 is a graphical representation showing the actual achieved reduction 
ratio versus the target reduction ratio for each macroblock of a series of frames. 

Figure 13 is a diagrammatic representation of a system that can be used to 
implement the techniques of the present invention. 

30 
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Detailed Description of Specific Embodiments 



The present invention generally relates to data compression. Data compression 
5 techniques are described generally in The Data Compression Book , by Mark Nelson 
(ISBN: 1558514341), the entirety of which is hereby incorporated by reference for all 
purposes. 

Many techniques for data compression are currently available. One particularly 
10 relevant technique for data compression is MPEG-2. MPEG-2 uses motion 
compensation, discrete cosine transforms, quantization, and variable length coding to 
rescale video data. Many prior art techniques have focused bit rate reduction and 
rescaling schemes on quantization, motion compensation, and variable length 
encoding. The present invention provides techniques for selectively filtering DCT 
15 coefficients to efficiently allow video compression to comply with desired reduction 
ratios while maintaining optimal perceivable image quality. More specifically, the 
present invention allows selective filtering of DCT coefficients associated with a 
macroblock to comply with reduction ratios by using rate control information derived 
fi-om compression filtering and encoding of prior macroblocks. 

20 

Figiu-e 1 is a diagrammatic representation of a system 129 that can use the 
techniques of the present invention. Figure 1 shows a system 129 that couples network 
101 and network 127. According to various embodiments, network 101 has one set of 
constraints while network 127 has a more restrictive set of constraints. For example, 

25 network 101 may allow transmission at a higher bit rate than network 127. A system 
129 receiving encoded content can reduce or rescale the content to allow transmission 
onto network 127. In one example, the bandwidth allocated on a network 101 to a 
particular user is 1 MBps while the bandwidth allocated for transmission on network 
127 for the same user is .8 MBps. A real-time video stream transmitted fi-om one 

30 network to another may benefit from improved techniques for rescaling the video 
stream to comply with the more restrictive constrains of network 127. 

In common embodiments, system 129 can be part of a network device such as a 
gateway, router, switch, or cable network headend equipment connecting two different 
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networks or networks having different network constraints. According to various 
embodiments, the encoded content is an MPEG bitstream. Note that the invention is 
not hmited to an application to MPEG compression, or even the video compression 
techniques generally. Rather the invention is applicable to any type of content in 
5 which transform coefficients are used to represent portions of content. Furthermore, 
the coefficients can be selected based upon the type of untransformed content they 
represent (e.g. high frequency vs. low-frequency spatial features of an image or an 
audio sample). 

10 For convenience, the invention will be described in the context of MPEG-2 

compression and bit rate reduction in an MPEG-2 video stream. The size of an MPEG 
bitstream can be reduced by filtering the transform coefficients in each MPEG frame. 
A system 129 can then apply a reduction ratio of .8 to the encoded content by filtering 
coefficients. The output bitrate divided by the input bitrate is herein referred to as a 

15 reduction ratio. It should be noted that filtering includes altermg coefficients, zeroing 
coefficients, setting a coefficient string to a particular sequence, or generally changing 
transform coefficients in block to allow an effective rescaling ratio. As will be 
appreciated by one of skill in the art, the bitstream is partially decoded before the 
transform coefficients are altered or filtered. The techniques of the present invention 

20 allow rescaling of a data sequence without complete decoding of the data sequence. 
According to various embodiments, rescaling the video stream does not involve 
computationally expensive inverse transform operations. Variable length decoding 
stage 103 receives the MPEG encoded bitstream and applies variable length decoding 
to extract a block 105. Block 105 typically represents a portion of a frame of MPEG 

25 video. 

As will be appreciated by one of skill in the art, the basic structxire for a coded 
video frame or picture is a block that is an 8 pixel by 8 pixel array. Multiple blocks 
form a macroblock, which in turn form part of a slice. In one embodiment, a block is a 
30 macroblock. A coded frame consists of multiple shces. Multiple coded frames form a 
group of frames. Such hierarchical layering of data structures localizes the most basic 
processing on the lowest layer, namely blocks and macroblocks. 
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As noted above, each block contains variable length codes for DCT 
coefficients. In the MPEG-2 syntax, the picture data section contains the bulk of the 
compressed video images. This is where the DCT coefficients are encoded as variable 
length codes. For a typical bitstream, this portion of the data takes somewhere between 
5 70%-90% of the total bit usage of a coded picture, depending on the coded bit rate. 

The access unit level information relates to coded pictures and may specify 
whether a picture is an intra frame (I-frame), a predicted frame (P-frame), or a bi- 
directional frame (B-frame). An I-frame contains full picture information. A P-frame 
10 is constructed using a past I-frame or P-frame. A bi-directional frame (B-frame) is bi- 
directionally constructed using both a past and a future I-frame or P-frame. I-frames 
can be referred to as anchor frames. 

Each video frame can be represented by luminance and chrominance pixels. 

15 The techniques of the present invention apply regardless of the type of frame or the 
type of pixel. Block 105 contains transform coefficients that roughly correspond to 
frequency information contained in the video block. Block 105 has low- frequency 
fransform coefficients 107 and higher frequency transform coefficients 109. Although 
the transform coefficients in block 105 do not correspond exactly to frequency 

20 information contained in a portion of the frame of MPEG video, the coefficients 
provide general information on the various types of frequency information in the 
portion of the frame. Block 105 is passed to a filtering stage 111. Filtering stage 111 
is optionally coupled with requantization stage 113. Both filtering at filtering stage 
1 1 1 and requantization at requantization stage 113 can be used to reduce the bandwidth 

25 requirements of the MPEG encoded bitstream. Filtering stage 111 can be used to 
selectively filter transform coefficients. 

According to various embodiments, filtering transform coefficients can 
comprise zeroing the transform coefficients or setting the transform coefficients to a 
30 particular sequence of numbers. In Figure 1, transform coefficients 123 of block 117 
are selected for filtering. Block 117 of modified transform coefficients comprises 
lower frequency transform coefficients 121 and higher frequency transform 
coefficients 123. Often frequency can be used as a filtering criteria for the coefficients. 
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Of course, other criteria such as computational convenience, etc. can be used to select 
coefficients for filtering. Various selection and filtering criteria will be described 
further below. 

5 Certain embodiments of the present invention select the higher frequency 

components 123 of the modified transform coefficient block 117 for filtering. Of 
course there may be other frequency selection criteria that can be used. Requantization 
stage 1 13 can also be used to zero transform coefficients 1 17 or to reduce the number 
of levels represented by a block 117. As will be appreciated by one of skill in the art, a 
10 block of modified transform coefficients 117 containing fewer levels and more zeroes 
can be efficiently variable length coded at VLC recoding stage 125. The modified 
transform coefficient block 117 can be encoded as a reduced output bitstream. The 
output bitstream can be provided to network 127. 

15 Rate control stage 115 monitors the number of input bytes and the number of 

output bytes along lines 131 and 133 respectively. Rate control stage 115 can use 
information about the number of input and output bytes for prior filtered blocks of data 
to provide rate control information for a current block. Rate control information can be 
provided to filtering stage 111 and to requantization stage 113 to allow control over 

20 rescaling. Information provided by rate control stage 115 can be used by filtering stage 
111 and requantization stage 113 to determine specifically how transform coefficients 
will be altered. According to various embodiments rate control information is provided 
by rate control stage 1 15 for each macroblock. 

25 As will be appreciated by one of skill in the art, each macroblock can comprise 

multiple component blocks. In MPEG-2, various formats may be employed to define 
the luminance and chrominance pixel content at a macroblock. Using the 4:2:0 format, 
a macroblock comprises four 8x8 matrices of luminance coefficients and two 8x8 
matrix of chrominance coefficients. Rate control information can be updated at rate 

30 control stage 1 1 5 after a macroblock has been processed. Of course, other formats can 
use the techniques of the present invention. Alternatively, rate control stage 115 may 
provide rate control information to filtering stage 111 and requantization stage 1 13 on a 
per frame or a per block basis. 
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Although that techniques of the present invention can be used in a network 
node connecting two networks having different bandwddth constraints, the techniques 
of the present invention are more general and can be applied in a variety of different 
5 contexts. For example, instead of receiving an MPEG encoded bitstream from a 
network 101, the MPEG encoded bitstream may be contained in a file that can be 
reduced in size prior to either storage, viewing, or transmission. System 127 can be 
used to reduce the size of a transform encoded file saved on a hard disk, CD, DVD, or 
other media. An MPEG encoded file can be variable length decoded at variable length 
10 decoding stage 103. A block 105 is forwarded to filtering stage 111 and/or 
requantization stage 1 13 to provide a modified block 117. 

According to various embodiments, a block of altered transform coefficients is 
then recoded at VLC encoding stage 125 and provided to output. A rate control stage 

15 115 can provide rate control information to filtering stage 111 and requantization stage 
113 based on the desired file reduction size. Information can be provided to filtering 
stage 111 and requantization stage 113 to allow a determination of how transform 
coefficients are altered or filtered. As noted above, rate control information can be 
provided for filtering on a per macroblock basis. In accordance with the techniques of 

20 the present invention, the inverse transform coding may be performed in a manner 
designed to meet a reduction ratio. 

Figure 2 is a diagrammatic representation of a another system that can use the 
techniques of the present invention. Figure 2 describes a system for using the 

25 techniques of the present invention to initially encode video content. Video data 201 is 
split into video frames 203. Each video fi-ame 203 can be uncompressed data. 
Although Figure 2 is described in the context of video data, one of skill in the art will 
understand that the techniques of the present invention can be applied to other types of 
data such as simple image data (e.g. JPEG) or audio data. Each video frame 203 can be 

30 divided into 16x16 pixel macroblocks. The macroblocks are further separated into 
component blocks of pixels. 
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Block 205 can represent 64 pixels of image data. A transform stage 207 is 
applied to the block 205. According to specific embodiments, the transform stage 207 
is a discrete cosine transform (DCT). The transform stage converts block 205 
representing pixel information to block 209 containing DCT coefficients. The 
5 transform coefficient block 209 can then be quantized at quantization stage 211. Using 
the techniques of the present invention, filtering stage 217 contains mechanisms for 
selectively filtering DCT coefEcients. The filtering stage 217, can contain mechanisms 
for selecting how and how many DCT coefficients to filter in order to obtain desired 
rescaling ratios. According to various embodiments, the coefficient filtering occurs for 
10 various intrablocks. 

As will be appreciated by one of skill in the art, the coefficients in the top left 
region of block 209 roughly correspond to low fi:equency components of block 205. 
The coefficients in the bottom right region of block 209 roughly correspond to high 

15 frequency components of block 205. The human eye is typically more sensitive to low 
frequency components than to high frequency components of an image. By removing 
low frequency components of an image, the edges and comers become more abrupt. 
By removing high frequency components of an image, the edges and comers tend to 
blur. By selectively filtering DCT coefficients of block 209, an image can be 

20 minimally altered while falling within boimds of a reduction ratio. The techniques of 
the present invention allow DCT coefficients to be selectively filtered in order to 
comply with reduction ratios. 

The techniques of the present invention allow filtering stage 207 to dynamically 
25 vary the number of DCT coefficients dropped based on the varying requirements of 
macroblock sequences. For example, a few macroblocks of a particular video frame 
may be particularly easy to compress. This may be due to the fact that the first few 
macroblocks contain mostly smooth areas. Quantization stage 211 and variable length 
and coding 213 are able to compact the information associated with the first he 
30 macroblocks into a small number of output bits in output bitstream 215. The target 
reduction ratio may seem easily achievable based on the first few macroblocks. 
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The filtering stage 217 can selectively filter fewer DCT coefficients. However, 
if the compression of prior macroblocks indicates that the total reduction ratio has not 
been achieved, filtering stage 217 can filter more DCT coefficients. Filtering more 
DCT coefficients tends to decrease the nximber of bits in the output compressed 
5 bitstream 215. 

As noted above, the DCT coefficients correspond to limited ranges of 
frequency information for the block 205. Often frequency can be used as a filtering 
criteria for the coefficients. Of course, other criteria such as computational 

10 convenience can be used to select coefficients based upon their contribution on human 
perception. As noted, the human eye is more sensitive to degradation of low-frequency 
spatial information than the degradation of higher frequency spatial information. 
Therefore, certain embodiments of this invention select the high frequency components 
of coefficient matrices for dropping. Of course there may be other frequency regimes 

15 that could be selected. In one embodiment, a system/method of this invention select 
particular frequency bands for filtering. Alternatively, for applications in edge and line 
detection, low-frequency components can be selected for filtering. 

The results from filtering stage 217 are variable length coded using VLC 
20 encoding at 21 3. The output compressed bitstream is provided at 21 5. 

Although the techniques of the present invention can be used in conjunction 
with all of the techniques described in Figure 2, it should be noted that not all the 
techniques of Figure 2 need to be used. For example, using the techniques of the 
25 present invention for selectively filtering DCT coefficients can allow quantization stage 
211 to be avoided. Avoiding quantization can prevent irrecoverable loss of image 
information. 

Figures 3 A and 3B are graphical representations of information loss when DCT 
30 coefficients are selectively filtered. In Figure 3 A, numerical values 307 are represented 
in graph 300. The values may be pixel luminance values, for example. Numerical 
value 301 in the graph corresponds to numerical value 305 in the list. The DCT 
coefficients for numerical values 307 are DCT coefficients 309. The eight numbers 



CISCP2 1 7/JKW/GKK 



13 



shown in graph 300 are transformed into the eight DCT coefficients 309. Taking an 
inverse discrete cosine transform (IDCT) using the eight< DCT coefficients 309 
produces curve 303 in graph 300. It should be noted that all the information associated 
with numbers 307 is maintained. By using all the DCT coefficients, line 303 
5 corresponds exactly with numbers 307 in graph 300. 

In Figure 3B, a DCT transform is applied to numerical values 317 to produce 
DCT coefficients. However, two high frequency DCT coefficients are filtered to yield 
DCT coefficients 319. As noted above, DCT coefficients can be filtered in order to 
10 comply with desired reduction ratios. Taking the IDCT using the six DCT coefficients 
319 yields curve 313. It should be noted that the curve 313 does not correspond 
exactly with numerical values 317. Curve 313 somewhat approximates the original 
numbers 317. Accordingly, it is typically desirable to filter or alter as few DCT 
coefficients as possible. 

15 

Generally, the number of coefficients designated for filtering is referred to 
herein as the cut-off index. However, a cut-off index can be defined in many ways, 
such as a position index separating coefficients associated with a pass band and 
coefficients associated with a stop band. As will be appreciated by one of skill in the 

20 art, DCT coefficients roughly correspond to frequency components of a particular data 
sequence. The cut-off index can also be referenced as a cut-off frequency. That is, 
frequency components above or below a certain cut-off frequency may be selectively 
filtered. According to other embodiments multiple cut-off indices can be used. The 
DCT coefficients between two cut-off indices can be filtered. In the same way, DCT 

25 coefficients between several cut-off frequencies can be filtered. Multiple cut-off 
indices and cut-off frequencies can allow filtering of DCT coefficients that have the 
least perceivable effect on the original data sequence. Most fundamentally, a cut-off 
index represents a quantity of data that must be removed to meet some bandwidth or 
storage requirements. 

30 

Figure 4 is a diagrammatic representation of filters that can be used to 
selectively drop transform coefficients according to the techniques of the present 
invention. Filters of different strengths can be predetermined and stored. It should be 
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noted that typically filters for MPEG-2 encoded blocks will be 8x8 blocks. However, 
for ease of discussion, the blocks shown in Figure 4 are 4x4 blocks. The filter 
parameters can be as simple as an array of zeros and ones, where a one indicates 
coefficients associated with a pass band and a zero indicates coefficients associated 
5 with a stop band. A transform coefficient from a block corresponding to a one in the 
filter block will be retained, while a transform coefficient corresponding to zero in the 
filter block will be dropped. Filter 401 illustrates a filter that selects three coefficients 
401a, 401b, and 401c for dropping. The other values in the filter are ones indicating 
pass band. It should be noted that filter 401 provides one way of implementing a cut- 
10 off index of 13. 

Filter 403 illustrates an implementation of a pass band filter. Filter 403 
contains ones in the pass band 403b that allow mid-frequency transform coefficients in 
a block to be retained. Low frequency band 403a and high frequency band 403 c 
15 contain zeros that filter high and low frequency transform coefficients in a block. 

Filter 405 is a fiher that is configured to drop half the coefficients associated 
with higher frequency components at a block. Ones are placed in portion 405a 
representing lower frequency components while zeroes are placed in portion 405b 
20 representing higher frequency components. Filter 405 applied to a block of transform 
coefficients retains only the 8 lower frequency coefficients of the transform block. 

It should be noted that although the filter is represented as a two-dimensional 
block, the filter can just as easily be represented as a one-dimensional array. Filter 407 
25 is one representation of filter 405 using a one-dimensional array of ones and zeroes. 

Instead of using one to zeroes, thresholds can be used to determine whether a 
particular coefficient in a transform block should be dropped. Figure 5 is a 
diagrammatic representation of filter blocks using thresholds for determining whether 
30 transform coefficients should be filtered according to various embodiments. As will be 
appreciated by one of skill in the art, transform coefficients in a block roughly 
correspond to frequency information of a portion of video or audio and can have 
different values. Larger values tend to indicate transform coefficients of greater 
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importance. According to various embodiments, if the transform coefficient of a block 
exceeds the corresponding value in a filter, the transform coefficient is retained. 

Filter 501 provides one example of a filter using threshold values. Filter 501 
5 has lower values in portion corresponding to low-frequency coefficients and higher 
values corresponding to high frequency coefficients. That is, applying filter 501 to a 
transform coefficient block would retain most of the low-frequency coefficients 
because most of the lower frequency coefiicients would be higher than the small values 
in portion 501a. Portion 501b in filter 501 contains higher values. Transform 
10 coefficients of a block would only be retained if they had a magnitude greater than the 
corresponding value in the filter 501. That is, high frequency coefficients in a block 
would only be retained if they had sufficient magnitude to exceed the higher values 
contained in portion 501b. 

15 Filter 503 is another embodiment of a filter that can be used to filter transform 

coefficients. Filter 503 has high threshold values in portion 503a. A transform 
coefficient block applying filter 503 would have many low-frequency coefficients 
removed since many low-frequency coefficients do not exceed the threshold values in 
portion 503a. A Filter using threshold values can also be implemented as a one- 

20 dimensional array as shown in filter 505. 

It should be noted that a variety of filters including low pass, high pass, notch, 
comb, and band pass filters can be implemented with the filters using ones and zeroes 
shown in Figure 4, or filters using threshold values shown in Figure 5. 

25 

Figure 6 is a diagrammatic representation of the application of two different 
filters to a block of transform coefficients. Block 601 contains low-frequency 
coefficients 601a and high frequency coefficients 601b, Filter 603 uses ones and 
zeroes to filter the coefficients of block 601. Block 603 contains ones in portion 603a 
30 and zeroes in portion 603b. Applying filter 603 to transform coefficient block 601 
yields block 605. Block 605 contains low-frequency coefficients 605a that correspond 
to low-frequency coefficients 601a. High frequency coefficients 605b are filtered 
because of the zeroes in portion 603b of filter block 603. 
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Filter block 609 uses threshold values to filter coefficients of block 601. 
Coefficients of block 601 exceeding the corresponding value in filter block 609 are 
preserved. Coefficients of block 601 that do not exceed the corresponding value in 
5 filter block 609 are dropped. For example, coefficient 601c in block 601 does not 
exceed the corresponding coefficient value 609c of 60, since 59 is less than 60. 
Consequently, the resulting block 611 has a 0 in value 611c. A filter 603 using ones 
and zeroes and filter 609 using threshold values can yield similarly processed blocks in 
605 and 611. It should be noted how^ever, that the resulting blocks 605 and 611 can be 
10 also quite different for different values. 

Figure 7 is a process flow diagram describing a technique for using 
predetermined filters to select how transform coefficients are filtered. At 701, the next 
block of transform coefficients is received. At 703, a filter for the current block is 

15 identified. The filter selected can be one of those shown in Figure 4 and Figure 5. For 
example, to achieve a target reduction ratio of 50 percent, a filter can be selected at 703 
that is similar to filter 403 of Figure 4. Techniques for selecting a filter are described 
in U.S. Patent Application No. / (Attorney Docket No. CISCP219) by Wu et 

al., and titled Methods and Apparatus for Transform Coefficient Filtering, the entirety 

20 of which is incorporated by reference for all purposes. At 705, the next transform 
coefficient fi-om the block is selected. It is determined at 707 whether to drop the 
transform coefficients based on selected filter. Usuig the filters of Figure 4, if the 
selected transform coefficient corresponds to a one in the selected filter, the transform 
coefficients preserved at 711. If the selected transform coefficient corresponds to a 

25 zero from the selected filter, the transform coefficient is dropped at 709. 

Applying a filter of Figure 5, a determination is made at 705 as to whether the 
selected transform coefficient exceeds a corresponding value in the selected filter. If 
the transform coefficient value exceeds a corresponding value in the selected filter, the 
30 transform coefficient is dropped at 709. If the transform coefficient selected from the 
block at 705 does not exceed the corresponding value of the filter selected at 703, the 
transform coefficient is preserved at 71 1. As will be appreciated by one of skill in the 
art, the process flow can also be easily configured to drop coefficients when transform 
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coefficients do not exceed the corresponding value in the selected filter. In other 
words, the threshold can either be the upper threshold or lower threshold for retaining a 
transform coefficient. At 713, it is determined whether any coefficients remain in the 
block. If coefficients remain, the next transform coefficient from the block is selected 
5 at 705. Otherwise, the next block of transform coefficients is received at 701. As 
noted above, the process of Figure 7 can be implemented using hardware that takes 
advantage of parallel processing. Certain hardware embodiments may be configured to 
apply the filter to all the transform coefficients of a block simultaneously, such as in a 
vector or matrix operation. 

10 

As noted above, filtering blocks of transform coefficients associated with input 
bits is one way of providing modified transform coefficients associated with rate 
reduced output bits. As will be appreciated by one of skill in the art, requantization is 
another way of altering transform coefficients. Figure 8 is a diagrammatic 

15 representation showing requantization applied to a 2x2 block of transform coefficients. 
The 2x2 block of transform coefficients is used to provide an illustrative example. In 
MPEG-2, blocks of transform coefficients are typically 8x8 matrices. Requantization 
is a technique for applying a new quantization scale to a block of transform 
coefficients. Requantization can be used at requantization stage 113 shown in Figure 

20 1. Block 801 shows four transform coefficients 8.3, 4.2, 5.1, .5. At 809, transform 
coefficient block 801 is quantized using a quantization scale of two. 

A quantization scale of two apphed to transform coefficient block 801 yields 
block 803 containing values 4, 2, 3, 0. It should be noted that quantization may be 

25 applied to block 801 before the MPEG encoded bitstream is provided to a network 101 
as shown in Figure 1 . Quantization is typically one of the steps used to MPEG encode 
a video bitstream. System 129 of Figure 1 can be a system that receives the input bits 
associated with block 803. A new quantization scale can be applied to the transform 
coefficient block 803 to fiirther reduce the size of the bitstream associated with 

30 transform coefficient block 803. In other words, a new quantization scale can be 
applied to the coefficient block 803 to rescale the MPEG encoded data. According to 
various embodiments, a new quantization scale of 4 is applied at 811 to the transform 
coefficient block 803. Since block 803 has a current quantization scale of two, 
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applying a quantization scale of four means that the transform coefficients in block 803 
are divided by 2 to yield the transform coefficients in block 805. It should be noted 
that the transform coefficients 2, 1, 2, 0 in block 805 are roughly equivalent to 
transform coefficients 8.3, 4.2, 5.1, .5 of block 801 divided by the quantization scale of 
5 4. 

The transform coefficients of block 805 now represent three levels, specifically 
0, 1, and 2. By contrast, the transform coefficients of block 803 represent four levels, 
specifically 0, 2, 3, and 4. Similarly, the transform coefficients of block 801 represent 

10 four levels. As will be appreciated by one of skill in the art, transform coefficients 
representing fewer levels can more efficiently be variable length coded. Furthermore, 
higher quantization scales typically lead to lower numbers and more zeroes in the 
transform coefficient block. Both of these effects can provide more efficient variable 
length coding. That is, the number of output bits associated with transform coefficient 

15 block 805 will typically be less than the number of input bits associated with transform 
coefficient block 801 because of the higher quantization scale. 

According to other embodiments, a new quantization scale of three is applied at 
813 to block 803. Since block 803 already has a quantization scale of two, applying a 
20 quantization scale of three at 8 13 means that the transform coefficients in block 803 are 
divided by 1.5 or 3/2. The division of the transform coefficients of block 803 by 1.5 
yields the transform coefficients of block 807. It should be noted that the transform 
coefficients in block 807 still represent the same number of levels as transform 
coefficients in block 803. 

25 

Figure 9 is a process flow diagram showing requantization of a transform 
coefficient block. At 901, the next block of transform coefficients is received. At 903, 
a quantization scale for the current block is selected using a reduction ratio. At 905, 
the next transform coefficient fi-om the block is selected. The transform coefficient is 
30 then requantized at 907. It is determined at 909 whether there are any remaining 
coefficients of the block. If there are remaining coefficients, the next transform 
coefficient from the block is selected at 905. If there are no remaining coefficients in 
the block, the next block of transform coefficients is received at 901. It should be 
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noted that although each transform coefficient is serially requantized in Figure 9, 
various chip architectures allow parallel requantization of the transform coefficients. 
According to various embodiments, 64 transform coefficients of an 8x8 MPEG-2 block 
can be requantized in parallel. 

5 

As noted in process 903 in Figure 9 and 703 in Figure 7, a filter and a 
quantization scale for a current block can be selected using a reduction ratio. Figure 10 
is a diagrammatic representation of various filters and quantization scales that can be 
selected using particular reduction ratios. In other words. Figure 10 provides filters 

10 and quantization scales that can be used to apply a reduction ratio to an input bitstream. 
Figure 10 assximes the original quantization scale of a transform coefficient block is 
three, although filters and new quantization scales can be provided for transform 
coefficient blocks with a variety of original quantization scales. The quantization scale 
of MPEG transform coefficients is often contained in or associated with the MPEG 

15 bitstream. As will be understood by one of skill the art, a quantization scale contained 
in or associated with the MPEG bitstream can be updated to reflect the new 
quantization scale after requantization. 

According to various embodiments, the reduction ratio is 75 percent. In other 
20 words, the output bitstream should use 75 percent of the bandwidth used by the input 
bitstream. Alternatively, the number of output bits should be 75 percent of the number 
of input bits. A reduction ratio of 75 percent can be used to select filter and 
quantization block 1001. Filter and quantization block 1001 comprises a filter 
configured to remove four high frequency coefficients in portion 1001b. Filter and 
25 quantization block 1001 also applies a new quantization scale of four. According to 
various embodiments, the new quantization scale can be calculated by dividing the 
original quantization scale of three by the reduction ratio of 75 percent. The original 
quantization scale of three divided by the reduction ratio of 75 percent yields the new 
quantization scale of four. Filter and quantization block 1003 can be identified in the 
30 same way. A new quantization scale of six can be determined by dividing the original 
quantization scale of three by the reduction ratio of 50 percent. The filter can be 
configured to filter half of the transform coefficients of a block. 
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With a reduction ratio of 25 percent, the filter can be configured to preserve 
only 25 percent of the transform coefficients. The new quantization scale of 12 can be 
calculated by dividing the original quantization scale of three by the reduction ratio of 
25 percent. 

5 

It should be noted that although Figure 10 provides 4x4 filters, filter of a wide 
variety of sizes and configurations can be provided. For example, 8x8 filters can be 
used for many transform coefficient blocks of an MPEG encoded bitstream. A one- 
dimensional filter as shown in Figure 4 in Figure 5 can also be used. Although the 
10 percentage of transform coefficients preserved by the filters in Figure 10 is equivalent 
to the reduction ratio, it should be noted that the percentage of transform coefficients 
preserved does not have to be equivalent to the reduction ratio. As will be appreciated 
by one of skill in the art, a variety of filters using threshold values can also be used. 

15 Figure 1 1 is a process flow diagram showing a technique for selecting a filter 

and a quantization scale. The techniques of Figure 1 1 can be used in process 703 in 
Figure 7 and process 903 in Figure 9. At 1 101, a reduction ratio is identified. At 1 103, 
a filter is selected with characteristics matching the reduction ratio. The filter can be 
one of those shown in Figure 10. At 1 103, the new quantization scale is identified. As 

20 noted above, the new quantization scale can be calculated by dividing the original 
quantization scale by the reduction ratio. According to various embodiments, both 
filters and quantization scales can be used to applying a reduction ratio to and input 
bitstream. Filters and quantization scales can be used individually or in unison to 
rescale input bits. 

25 

More detail will not be provided on process 1101 in Figure 11. Specifically, 
more detail will be provided on identifying a reduction ratio for a particular block. As 
noted above, and input bitstream can be video comprising a series of frames. In 
MPEG-2, each fi-ame comprises a plurality of macroblocks. Each macroblock can be 
30 separated into blocks representing luminance and chrominance pixels of the 
macroblock. A target reduction ratio can be set for the video. The target reduction 
ratio generally refers to the desired size of the video divided by the current size of the 
video. If the video is currently 100 MB and the video needs to be reduced to 75 MB, 
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the target reduction ratio would be 75 percent. According to one very simple 
embodiment, the target reduction ratio would be the reduction ratio for every block of 
transform coefficients in the fi-ame. The reduction ratio of 75 percent can be used to 
select a filter and a new quantization scale such as that of block 1001 of Figure 10. 

5 

As will be appreciated by one of skill in the art, however, applying a filter and a 
new quantization scale shown in block 1001 to a block of transform coefficients does 
not necessarily result in an actual reduction ratio of 75 percent. Figure 12 shows the 
actual achieved reduction ratio vs. the target reduction ratio for each macroblock of a 

10 series of frames. The mean and the standard deviation of the resulting actual reduction 
ratio are provided. The values of the horizontal and vertical axes should be divided by 
128 to show the reduction ratio. It should be noted that a target reduction ratio of 75 
percent leads to a relatively wide range of actual reduction ratios. Consequently, the 
present invention provides techniques for adjusting the reduction ratio during the 

1 5 rescaling of a particular input bitstream. 

The present invention contemplates using feedback to allow rescaling of an 
input bitstream to meet a target reduction ratio effectively. According to preferred 
embodiments, a reduction ratio is updated after each macroblock is processed. In other 

20 words, the blocks representing luminance and chrominance pixels associated with an 
MPEG-2 macroblock can be processed using the same reduction ratio. After the 
macroblock is processed however, the reduction ratio is updated based on be number of 
output bits associated with the altered transform coefficients of the processed 
macroblock and the number of input bits associated with the transform coefficients of 

25 the original macroblock. Rimning tallies of input bits associated transform coefficients 
of original macroblocks and output bits associated with altered transform coefficients 
of processed macroblocks can also be maintained to provide more sophisticated 
feedback information for updating the reduction ratio. A variety of factors can be used 
to update the reduction ratio. Generally, factors that can be used to update the 

30 reduction ratio are referred to as rate control information. 

According to various embodiments, rate control information can be updated on 
a per block basis. In other words, instead of waiting for the processing of multiple 
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blocks associated with a macroblock, the reduction ratio can be updated as soon as a 
single block of transform coefficients is altered. It still other embodiments, rate control 
information can be updated after several macroblocks are processed. The following 
equations detail the calculation of a reduction ratio performed on a per macroblock 
5 basis. 

Rt = Bt / Bi (Equation 1) 

where 

Rt is the target reduction ratio for the frame; 
10 Bi is the total input bit size of the frame; and 

Bt is the target output bit size of the frame. 

Equation 1 can be used to calculate the target reduction ratio. According to 
preferred embodiments, the target reduction ratio varies on a per frame basis. The 
15 target reduction ratio for I-frames is generally higher than the target reduction ratio for 
B-frames and P-frames. As noted above, I-frames are independent frames comprising 
the actual image information. B-frames and P-frames are dependent frames that are 
associated with motion vectors and differential information. 

20 Ru = (Rtbc - (bo -Rtbi))/bc (Equation 2) 

where 

Ru is the reduction ratio for the cvirrent macroblock; 
Rt is the target reduction ratio for the frame; 
be is the number of bits for the current macroblock; 
25 bo is the total number of bits output for the prior macroblocks in the frame; and 

bi is the total number of bits input for the prior macroblocks in the frame. 

Equation 2 provides a simple technique for calculating an actual reduction ratio 
that addresses deviations from the target reduction ratio. The value bi is a ruiming tally 
30 of the total number of input bits associated with transform coefficients of processed 
macroblocks in the frame. The value bo is a running tally of the total number of output 
bits associated with altered transform coefficients of processed data blocks in the 
frame. If the reduction ratio is being calculated for the first macroblock of the frame, 
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both bi and bo are zero. The value be is the number of bits for the current macroblock. 
The (bo-Rtbi) term indicates by how many bits cumulatively the target reduction ratio is 
being missed. For example, if the reduction ratio is 75 percent and the running tally of 
output bits is 80 and the running tally of input bits is 100, the term would indicate that 
5 the target reduction ratio is being missed by five bits. Equation 1 attempts to 
compensate for the missed by increasing or decrease in the reduction ratio for the next 
macroblock. However, fully compensating for the miss in the next macroblock may 
cause significant jittering of the reduction ratio between macroblocks. It is also likely 
that the target reduction ratio can not be achieved. 

10 

Ru = (RtBi - bo)/(Bi - bi) (Equation 3) 

where 

Ru is the reduction ratio for the current macroblock; 
Rt is the target reduction ratio for the frame; 
15 Bi is the number of input bits for the fi-ame; 

bo is the total number of bits output for the prior macroblocks in the frame; and 
bi is the total number of bits input for the prior macroblocks in the frame. 

Equation 3 allows compensation for the miss to be disbursed across all 
20 remaining macroblocks. For example, where the number of input bits for the frame is 
1000, the ruiming tally of output bits for the prior macroblocks is 80, the running tally 
of the input bits for the prior macroblocks is 100, and the target reduction ratio for the 
firame is 60 percent, the updated reduction ratio would be (1000*60% - 8 0)7(1000-1 00) 
= 520/900 = 57.78%. Equation 3 allows more gradual compensation for a target 
25 reduction ratio miss. Equation 3 provides relatively low variations in reduction ratios 
from macroblock to macroblock that jittering of the reduction ratio still occurs during 
the later macroblocks of the frame. Again, the target reduction ratio may be missed. 

Ru = (Rt(bi + W) - bo)/W (Equation 4) 

30 where 

W is a constant representing a predetermined number of bits; 
Ru is the reduction ratio for the current macroblock; 
Rtis the target reduction ratio for the frame; 
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bo is the total number of bits output for the prior macroblocks in the frame; and 
bi is the total number of bits input for the prior macroblocks in the frame. 



Equation 4 introduces a spreading factor W. the spreading factor represents a 
next group of bits. According to various embodiments, the spreading factor is 1/4 or 
1/8 of Bi or the total input bits of the frame. The spreading factor reduces jittering 
while increasing the probability that meeting the target reduction ratio. For example, 
where W is 25% of 1000 bits or 250, bi is 100, bo is 80, and Rt is 60%, the updated 
reduction ratio would be (60%(100 + 250) - 80)/250 or 52%. According to various 
embodiments, where W decreases, the reduction ratio increases. Similarly, where W 
increases, the reduction ratio decreases. The spreading factor reduces the likelihood of 
jitter during the processing of later macroblocks in a frame. However, it is still 
possible that he target reduction ratio will be missed. 

Ru = (Rt(bi + W) - bo + a( Rtb; - bo))/W (Equation 5) 



where 

a is a convergence factor; 

Ru is the reduction ratio for the current macroblock; 

Rt is the target reduction ratio for the frame; 

W is a constant representing a predetermined number of bits; 

bo is the total niunber of bits output for the prior macroblocks in the frame; and 

bi is the total number of bits input for the prior macroblocks in the frame. 

Equations 5 and 6 introduce a convergence factor. The convergence factor can 
be used to force overshooting of the target. For example, if the target reduction ratio is 
60 percent, W is 250 bits, bi is 100 bits, bo is 80 bits, and the convergence factor is /4, 
the updated reduction ratio would be 60% + ((1 + i/2)(60%*100 - 80))/250 = 48%. It 
should be noted that the convergence factor is used to overshoot the target reduction 
ratio. According to various embodiments, D. convergence factor is 1 or 1/2. 



= Rt + ((1 + a)(Rtbi - bo))/W 



(Equation 6) 



R„ - Rt + ((1 + a)(Rtbi - bo))/W + /d 



(Equation 7) 



where 
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is a per frame offset value; 
a is a convergence factor; 

Ru is the reduction ratio for the current macroblock; 
Rt is the target reduction ratio for the frame; 
5 W is a constant representing a predetermined number of bits; 

bo is the total number of bits output for the prior macroblocks in the frame; and 
bi is the total number of bits input for the prior macroblocks in the frame. 

Equation 7 introduces a compensation factor. The compensation factor allows 
10 calculation of filter is and quantization scales as shown in Figure 10 to be rough 
estimates. After the filter and quantization scales as shown in Figure 10 are applied to 
a particular frame type, and a compensation factor can be used to increase the accuracy 
of the reduction ratio calculation. A compensation factor is calculated using a prior 
compensation factor for the same frame type. For example, compensation factors are 
15 tracked for I-frames, B-frames, and P-frames. A compensation factor is adapted to 
both past performance as well as picture type as shown in Equation 8. 

/d =/d' +(Rt - Bo/Bi) (Equation 8) 

where 

20 is a previous offset for the same frame type. 

Rtis the target reduction ratio for the frame; 
Bi is the total input bit size of the frame; and 
Bo is the total output size of the frame. 

25 The techniques of the present invention for determining updated reduction 

ratios can be used in a variety of rescaling systems as well as encoding systems. The 
techniques are not limited to update a great reduction ratios on a per macroblock basis 
that can be applied to 18 other unit of data. Using the techniques of the present 
invention, variation in reduction ratios between macroblocks is reduced while 

30 achieving target reduction ratios. 

The present invention for altering transform coefficients to provide rate 
reduction in a bitstream can be implemented in various network systems. In various 
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embodiments, this is implemented in the headend of a high bandwidth networks such 
as a cable network or a satellite network. In the context of a cable network, the 
invention may be implemented in a standalone system, such as Cisco 6920 RateMux® 
available from Cisco Systems, Inc, or in a line card of a cable network headend such as 
5 the Cisco UBR 7200 also available from Cisco Systems, Inc. 

Figure 13 depicts the basic components of a cable modem headend that can be 
used to implement the present invention, according to specific embodiments. Although 
the techniques of the present invention can be integrated into a cable modem headend, 
10 the present invention can also be used in a standalone system. Figure 13 shows an 
implementation using the cable modem headend. 

A Data Network Interface 1 302 is an interface component between an external 
data source and the cable system. External data sources transmit data to data network 
15 interface 1302 via optical fiber, microwave link, satellite link, or through various other 
media. Also as mentioned above, a Media Access Control Block (MAC Block) 1304 
receives data packets from a Data Network Interface 1302 and encapsulates them with 
a MAC header. 

20 In a specific embodiment as shown in Figure 13, CMTS provides functions on 

three network layers including a physical layer 1332, a Media Access Control (MAC) 
layer 1330, and a network layer 1334. Generally, the physical layer is responsible for 
receiving and transmitting RF signals on the cable plant. Hardware portions of the 
physical layer include a downstream modulator and transmitter 1306 and an upstream 

25 demodulator and receiver 1314. The physical layer also includes software 1386 for 
driving the hardware components of the physical layer. 

Once an information packet is demodulated by the demodulator/receiver 1314, 
it is then passed to MAC layer 1330. A primary purpose of MAC layer 1330 is to 
30 encapsulate and decapsulate packets within a MAC header, preferably according to the 
above-mentioned DOCSIS standard for transmission of data or other information. 
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MAC layer 1330 includes a MAC hardware portion 1304 and a MAC software 
portion 1384, which function together to encapsulate information packets with the 
appropriate MAC address of the cable niodem(s) on the system. After the upstream 
information has been processed by MAC layer 1330, it is then passed to network layer 
5 1334. Network layer 1334 includes switching software 1382 for causing the upstream 
information packet to be switched to an appropriate data network interface on data 
network interface 1302. 

When a packet is received at the data network interface 1302 from an external 
10 source, the switching software within network layer 1334 passes the packet to MAC 
layer 1330. MAC block 1304 transmits information via a one-way communication 
medium to downstream modulator and transmitter 1306. Downstream modulator and 
transmitter 1306 takes the data (or other information) in a packet structure and converts 
it to modulated downstream frames, such as MPEG or ATM frames, on the 
15 downstream carrier using, for example, QAM modulation (other methods of 
modulation can be used such as CDMA (Code Division Multiple Access) OFDM 
(Orthogonal Frequency Division Multiplexing), FSK (FREQ Shift Keying)). The 
return data is likewise modulated using, for example, QAM 16 or QSPK. Data fi-om 
other services (e.g. television) is added at a combiner 1307. Converter 1308 converts 
20 the modulated RF electrical signals to optical signals that can be received and 
transmitted by a Fiber Node 1310 to the cable modem hub. 

It is to be noted that alternate embodiments of the CMTS (not shown) may not 
include network layer 1334. In such embodiments, a CMTS device may include only a 
25 physical layer and a MAC layer, which are responsible for modifying a packet 
according to the appropriate standard for transmission of information over a cable 
modem network. The network layer 1334 of these alternate embodiments of CMTS 
devices may be included, for example, as part of a conventional router for a packet- 
switched network. 

30 

In a specific embodiment, the network layer of the CMTS is configured as a 
cable line card coupled to a standard router that includes the physical layer 1332 and 
MAC layer 1330. The techniques of the present invention including a filtering stage 
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and rate control stage shown in Figure 1 can be implemented on a line card. Using this 
type of configuration, the CMTS is able to send and/or receive IP packets to and from 
the data network interface 1302 using switching software block 1382. The data 
network interface 1302 is an interface component between external data sources and 
5 the cable system. The external data sources transmit data to the data network interface 
1302 via, for example, optical fiber, microwave link, satellite Hnk, or through various 
media. The data network interface includes hardware and software for interfacing to 
various networks such as, for example, Ethernet, ATM, frame relay, etc. 

10 As shown in Figure 13, the CMTS includes a hardware block 1350 including 

one or more processors 1355 and memory 1357. These hardware components interact 
with software and other hardware portions of the various layers within the CMTS. 
Memory 1357 may include, for example, I/O memory (e.g. buffers), program memory, 
shared memory, etc. Hardware block 1350 may physically reside with the other CMTS 

15 components. 

In one embodiment, the software entities 1382, 1384, and 1386 are 
implemented as part of a network operating system running on hardware 1350. 
Further, the provisions of this invention for providing quality of service for multicast 
20 streams are preferably implemented in software as part of the operating system. 

Because such information and program instructions may be employed to 
implement the systems/methods described herein, the present invention relates to 
machine readable media that include program instructions, state information, etc. for 

25 performing various operations described herein. Examples of machine-readable media 
include, but are not limited to, magnetic media such as hard disks, floppy disks, and 
magnetic tape; optical media such as CD-ROM disks; magneto-optical media such as 
optical disks; and hardware devices that are specially configured to store and perform 
program instructions, such as read-only memory devices (ROM) and random access 

30 memory (RAM). The invention may also be embodied in a carrier wave travelling 
over an appropriate medium such as airwaves, optical lines, electric lines, etc. 
Examples of program instructions include both machine code, such as produced by a 
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compiler, and files containing higher level code that may be executed by the computer 
using an interpreter. 

While the invention has been particularly shown and described with reference 
5 to specific embodiments thereof, it will be understood by those skilled in the art that 
changes in the form and details of the disclosed embodiments may be made without 
departing from the spirit or scope of the invention. For example, the embodiments 
described above may be implemented using firmware, software, or hardware. 
Moreover, embodiments of the present invention may be employed with a variety of 

10 communication protocols and should not be restricted to the ones mentioned above. 
For example, the techniques for rescaling data can be implemented a variety of systems 
including a router, a line card of a CMTS, or a generic computer system. In addition 
and as mentioned above, the invention may be implemented in both differential and 
single-ended configurations. Therefore, the scope of the invention should be 

1 5 determined with reference to the appended claims. 
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